Infinite matrices may violate the associative law 
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Abstract 

The momentum operator for a particle in a box is represented by an infinite order 
Hermitian matrix P. Its square P 2 is well defined (and diagonal), but its cube 
P 3 is ill defined, because P P 2 ^ P 2 P . Truncating these matrices to a finite order 
restores the associative law, but leads to other curious results. 
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Matrices of infinite order are used in quantum mechanics for representing dynamical variables. 
It is commonly assumed that these matrices obey the usual laws valid for finite matrices. How- 
ever, it is obvious that difficulties may occasionally arise because of divergent infinite sums. We 
give here a very simple example of a Hermitian matrix P such that P 2 is well defined, but P 3 
is not, because P P 2 ^ P 2 P . 

Related ambiguities may also arise in numerical calculations, when infinite matrices are trun- 
cated to finite size (because there are no infinite computers). While the associative law is obvi- 
ously fulfilled, the difficulty emerges in some other way. In particular, we sometimes encounter 
in quantum mechanics operators which are symmetric, but are not self-adjoint (for example, 
the radial momentum operator). The representation of these operators by Hermitian matrices 
conceals their lack of self-adjointness and leads to curious results, as in the following example. 

Consider the Hilbert space of functions f(x) , with < x < n , and inner product 

(/,</>= r(x)g(x)dx. (1) 
J o 

In that space, the set of functions u m (x) = (2/ir) 1 ^ 2 sin mx , for all positive integers m, can 
be taken as a complete orthonormal basis (namely, any square integrable function of x can be 
written as a linear combination of the u m , except at a finite number of isolated points). The 
operator p = —id/dx has matrix elements 



P 



mn 



/ u m (— id/dx) u n dx, (2) 
Jo 



—4:imn/n(m 2 — n 2 ) if m + n is odd, 
if m + n is even. 

This operator, currently defined only on functions which vanish at x = and x = it , has the 
physical meaning of linear momentum of a particle in a rigid one-dimensional box. It can be 
directly verified by algebraic methods [1] that 

oo 

y ' Pms Psn inn 5 mn , (3) 

s=l 

as could be expected from 

(P 2 ) mn = [ u m (-id/dx) 2 u n dx. (4) 
Jo 

However, difficulties occur if we try to define likewise 

(P 3 ) mn = [ u m (-id/dx) 3 u n dx. (5) 
Jo 
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The result is not Hermitian. This should not be a surprise, because, with the above boundary 
conditions, the operator —id/dx is not self-adjoint [2,3]. The matrices P mn and (P 2 ) mn in 
Eqs. (2) and (4) were Hermitian, because it was then possible to perform integrations by parts, 
in which the boundary terms vanished. However, if we attempt to do the same with Eq. (5), we 
obtain 

P 7T P TV 

J o u' m u„dx = - J o u'^u' n dx+[u' m u' n ] Q , (6) 

and the last term does not vanish. 

The source of the difficulty is that the domain of P consists of functions on the interval [0, ir] 
which vanish at the interval endpoints. However, Pu n does not vanish at these endpoints, and 
therefore does not belong to the domain of P : the expression P(Pu n ) is not mathematically 
defined. This does not contradict the fact that (—d 2 /dx 2 )u n = n 2 u n . The point is that, when 
P is defined as above, P 2 is not the same as (—d 2 /dx 2 ) , notwithstanding Eq. (4). These two 
operators coincide in the common part of their domains of definition, but the domain of definition 
of P 2 is smaller. 

The issue we are investigating here is how these curious properties appear when matrix no- 
tations are used for representing differential operators (for example in a mundane numerical 
analysis). If we try to use algebraic methods for defining P 3 , we encounter the same difficulty 
in another form: 



oo oo 



ms Psn ■ (7) 

S=l 8 = 1 

Let us examine why the associative law fails. In the present case, the right hand side of 

? 00 

(-P )mn ^ ' P-mr P-rs Psn j (8) 
r,s 

contains infinitely many terms with r = s±l^m + n, which behave as ± l/ r • Therefore 
the sum of positive terms diverges, the sum of negative terms diverges, and the entire sum in (8) 
is only conditionally convergent. Its value depends on the order of summation. 

It is interesting to see how this difficulty is reflected in numerical calculations, where infinite 
matrices are truncated and replaced by finite matrices of order N (some large number). Instead 
of (8), we may try to define 

N 

Smn Z i Prar Prs Psn ; (9) 

iV-^oo — 
r.s 
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and the question is whether this sum indeed tends to a unique limit as N — > oo . In the present 
case, it does, as shown below. However, this happens only because of delicate cancellations 
between positive and negative terms, which would not occur in general. 

Consider for definiteness the case where m and s are odd, and n, r , and N are even. The 
question is whether, for N ^ m + n , the contributions of r = N and s = N — 1 to the sum in 
Eq. (9) are vanishingly small. These contributions are, apart from an overall factor 64imn/7i 3 , 

iV 2 N ~ 1 s 2 (N-l) 2 N ~ 2 r 2 

—— — y + — ^- — - — y do) 

m 2 - N 2 fr{ (N 2 - s 2 ) (s 2 - n 2 ) (TV — I) 2 - n 2 f^ 2 (m 2 - r 2 ) [r 2 - (N - l) 2 ] 1 ; 

(The second sum runs only to r = N — 2 , in order to avoid double counting of the r,s = N, N—l 
matrix element.) In the above sums, the main contribution comes from terms where r and s are 
close to N , so that the various terms are of the order of A r ~ 1 , rather than N~ 2 . We therefore 
write r = N — 2k and s = N — 1 — 2k , and we neglect terms with m 2 and n 2 , which are much 
smaller than iV 2 . The two sums in (10) become 

1 1 
S (2k + 1) (2N - 2k - 1) + S (2k - 1) (2N - 2k - 1) ' ^ 



k 



To be consistent with the preceding approximation, the sums in (11) run up to values of k 
which are much smaller than N/2 (so that both r and s are close to N). We thus obtain, 
approximately, 

2iv(~£2TTT + £2T^l) = iv(2 + £)4F^l)- (12) 

For large iV , the sums on the left hand side behave as =F log iV . However, the sum on the right 
hand side tends to — | (see ref. [1] ) so that the whole expression vanishes. In summary, the 
total contribution of the N — l and iV terms to the sum in Eq. (9) is of the order of N~ 2 , and 
that sum converges (for the particular matrix P mn under consideration). 

We have found empirically that, as iV becomes much larger than m and n (which are kept 
fixed), matrix elements S mn tends to the Hermitian part of the right hand side of Eq. (5), namely 
to 



Rmn = (i/2) 



/ u rn (d/ dxf u n dx — / u n (d/dx) 3 u m dx 
Jo Jo 



(13) 



Note that the right hand side of (13) can also be written, after integration by parts, as 
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Jo 



'n 



ii 



+ u> n ul)dx={i/2) n 
Jo 



n 2 u' m u n — m 2 u' n u m ) dx, 



[m 2 P r 



mn 



n Prim)/^ — {jm Pmn ~\~ Pmn^ )/2, 



(14) 



which is the average of P 2 P and PP 2 . The convergence was found to be of oscillatory type, as 
illustrated in Table 1, for a few randomly chosen matrix elements. 



Table 1. 


Converj 


^ence of S 


mn toward 


Pmn (the 


table lists values of 


iS mn ) . 


m 


n 


7V = 99 


iV = 100 


A^ = 999 


iV=1000 


iV=1999 


TV = 2000 


iRmn 


1 


2 


2.156 


2.088 


2.127 


2.117 


2.125 


2.120 


2.122 


2 


3 


9.828 


10.032 


9.918 


9.945 


9.924 


9.939 


9.931 


20 


31 


935.67 


959.59 


956.05 


958.89 


956.77 


958.31 


957.56 


60 


91 


5198.7 


6667.9 


8803.1 


8828.4 


8814.0 


8827.5 


8822.4 



On the other hand, truncation may also lead to counterintuitive results. For example, if we 
define 



the eigenvalues of Q mn appear in opposite pairs, A and —A (because Q and Q T = —Q have 
the same eigenvalues). The only exception occurs if ./V is odd: the eigenvalue is nondegenerate. 
Therefore the eigenvalues of Q 2 are doubly degenerate (except the null eigenvalue, if iV is odd). 
This should be contrasted with the spectrum of P 2 which is 1, 4, 9, 16, ... 

We have found empirically that the eigenvalues of Q mn are very close to integers with a 
parity opposite to that of iV . The lowest and highest eigenvalues of Q 2 are listed in Table 2, for 
iV = 999 and 1000. They appear quite different from those of P 2 . However, if we first compute 
Q 2 with iV = 1000 , and then truncate it by removing the 1000th row and column (or more rows 
and columns with the largest indices), the eigenvalues of the resulting matrix are nondegenerate 
and are very close to those of P 2 . 




m and n < N, 
m or n > N, 



(15) 
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Table 2. Some eigenvalues of Q 2 , for N = 999 and 1000. 



1 
2 
3 
4 
5 



N = 999 iV = 1000 N = 1000 

(complete matrix) (truncated to 999) (complete matrix) 



0.000000 
3.986641 
3.986641 
15.94656 
15.94656 



0.996663 
3.986641 
8.969969 
15.94656 
24.91658 



0.996663 
0.996663 
8.969969 
8.969969 
24.91658 



996 
997 
998 
999 
1000 



988294.4 
988294.4 
992673.3 
992673.3 



988294.4 
990283.2 
992673.3 
994666.5 



986110.2 
990283.2 
990283.2 
994666.5 
994666.5 



These curious results can be explained by rearranging the rows and columns of these matrices 
according to the sequence 1, 3, . . . , N — 1 , 2, 4, . . . , N — 2 or N . Since the only nonvanishing 
matrix elements (Q 2 ) mn are those whose m and n have the same parity, the matrices Q 2 
separate into two blocks. In each matrix, these two blocks have the same eigenvalues (so that 
each eigenvalue appears twice, except the null eigenvalue, if N is odd). It can be seen by direct 
inspection that the truncated matrix also consists of two blocks. Its odd-odd block is identical 
to the odd-odd block of Q^ 10 oo) > an< ^ even-even block is the same as the even-even block of 
^(999) • This explain why the same eigenvalue appears in three different positions in Table 2. 
We have also found empirically that removing additional rows and columns from Q 2 (those with 
the largest indices) improves the convergence toward the "true" eigenvalues, given by Eq. (3). 

As higher powers of P are considered, the results become curiouser and curiouser. For 
instance, P 4 = P 2 P 2 is well defined, and has matrix elements m 2 n 2 S mn , by virtue of Eq. (3). 
On the other hand, PP 2 P is ill-defined. If we write 

16 s 4 

(P P 2 P) mn = - -mn £ — —i - , (16) 

n 2 g [m 2 — s 2 ) (s 2 — n 2 ) 

where s runs over all integers with a parity opposite to that of m and n , each term in this sum 
tends to —1 when s — > oo , so that the sum is not even conditionally convergent. Moreover, 
if we introduce truncated matrices, as in Eq. (15), the sum J2QmrQrsQstQtn does not tend 
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to m 2 n 2 S mn . It diverges, for any finite m and n (with same parity), when N — > oo . The 
point is that YJ Qmr Qrs converges to ms5 ms only if s is kept fixed as N — > oo , and likewise 
for J^QstQtn ■ On the other hand, when we sum over s to obtain (Q 4 ) mn , that sum includes 
terms where s is of the order of iV and the latter have a divergent contribution. 

The conclusion to be drawn from these results is that truncation methods, which are common 
practice in quantum mechanical calculations, should be used with extreme caution when the 
truncated matrices represent unbounded operators. In the particular case discussed here, where 
the matrix P is defined by Eq. (2), the truncated matrix P 2 should be defined by squaring the 
truncated P and then deleting the last row and column (otherwise, the eigenvalues turn out to 
be completely wrong). Likewise, the truncated matrix P 3 should be defined by Eq. (9), and 
not by Eq. (5) which gives a non-Hermitian result. However, these recipes may not be valid in 
general, and other unbounded operators ought to be considered on a case by case basis. 
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